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The cooperation and coordination in multi-robot systems is a popular topic 
in the field of robotics and artificial intelligence, thanks to its important role 
in solving problems that are better solved by several robots compared to a 
single robot. Cooperative hunting is one of the important problems that exist 
in many areas such as military and industry, requiring cooperation between 
robots in order to accomplish the hunting process effectively. This paper 
proposed a cooperative hunting strategy for a multi-robot system based on 
wolf swarm algorithm (WSA) and artificial potential field (APF) in order to 
hunt by several robots a dynamic target whose behavior is unexpected. The 
formation of the robots within the multi-robot system contains three types of 


Mobile robots roles: the leader, the follower, and the antagonist. Each role is characterized 
; g 
Multi-robot systems by a different cognitive behavior. The robots arrive at the hunting point 
Optimization accurately and rapidly while avoiding static and dynamic obstacles through 
Path planning the artificial potential field algorithm to hunt the moving target. Simulation 
results are given in this paper to demonstrate the validity and the 
effectiveness of the proposed strategy. 
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1. INTRODUCTION 

The cooperative hunting problem is an ideal problem for the research on multi-robot cooperation 
and coordination [1]-[3]. These robots work together to hunt a single target or a group of targets. Due to 
several factors, including uncertainty, target’s velocity, target’s behavior, lack of information, and the nature 
of the search space, it is difficult to perform complex tasks such as hunting with a single robot [4]-[7]. 
Conversely, in order to compensate for the shortage and weaknesses of using a single robot for hunting, a 
multi-robot system which is composed of many robots is employed instead. Multi-robot hunting is one of the 
most important and complex challenges in the field of multi-robot cooperation due to the fact that it includes 
all multi-robot system subproblems such as task allocation, target localization, collaborative pursuing, 
obstacle and collisions avoidance. The research on multi-robot system cooperative hunting covers 
many disciplines and domain knowledge, such as optimization algorithms [8], real-time dynamic path 
planning [9]-[10], multi-robot coordination [11]-[13], planning and learning [14], and communication [15]. 

There are many methods to solve the problem of multi-robot cooperative hunting. These methods 
are classified into two categories. The first category is cooperative hunting strategies based on artificial 
intelligence algorithms that are being used to hunt the target whose location is previously known to 
hunters [16]. For the second category, the hunting methods are based on sensor information, where the 
location of the target is not predefined or the environment is unknown [17]. Both categories have some 
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disadvantages such as in the first category methods that mainly depend on the information provided to the 
hunters in advance about the location of the target, which is impractical when the target is constantly moving 
in a random way. Whereas, the second category methods are mainly depending on the information provided 
by the sensors, therefore this information is not always reliable and do not produce robust results. The 
information that the sensors detect about research space is used by robots to capture the target [18]. This 
information is analyzed in order to find out the location of the robots, track the target, and detect obstacles. 

At present, the hunting with multi-robot attracts the attention of many researchers, as it is both 
significant and complex. As a result of this, many researches have been done that propose solutions for the 
problems of multi-robot cooperative hunting methods. Yan Wang et al. [19] proposed an improved Voronoi 
graph self-organizing cooperative hunting algorithm for the problem that capturing random targets with a 
conventional distributed algorithm based on Voronoi graph has a low efficiency and a high standard 
deviation. In this algorithm, Hunters find their own targets by choosing the nearest midpoint of a typical 
Voronoi graph boundary, and form a hunter’s alliance for the same target. To hunt, each hunter in the group 
employs a goal Voronoi graph area minimization technique. In [20], a cooperative hunting method inspired 
by the behavior of biological societies is proposed which uses four parameters that are repulsion, attraction, 
orientation, and influence. While attempting to catch the prey, the hunter robots create a cooperative swarm, 
and changing the parameters allows for decentralized control of the swarm's formation and behavior. The 
researchers in [21]-[23] propose solutions to solve cooperative hunting with multi-robot based on 
reinforcement learning. 

In this paper, the cooperative hunting multi-robot system is studied when the target moves with an 
unknown behavior, and a hunting strategy based on wolf swarm algorithm and artificial potential field is 
proposed. This hunting strategy is geared towards solving the problems of target escaping from hunters and 
long hunting time. In order to capture a target which is constantly moving at different speeds and in different 
directions with the presence of obstacles within the search space, the hunter robots must have the ability to 
keep the hunting formation and self-adjust while interacting with the environment. Moreover, this hunting 
process is independent of dynamic constraints related to the robot model. The specific hunting process is as 
follows: firstly, with moving of the target, the robots behave with the hunting method for wolves’ swarm 
using the wolf swarm algorithm which is to enable robots to work under a certain configuration. In the wolf 
swarm algorithm, the multi-robot system contains three kinds of robots, the leader, the antagonist and the 
follower that are for preventing the target’s escape when they detect and catch it as a collaborative swarm. 
Robots take turns during the hunt, according to the position of the robot with the target. Each wolf (robot) has 
a specific cognitive behavior that enables it to work with the swarm, track, and hunt the prey. Secondly, the 
robots reach the desired point rapidly and safely through the artificial potential field which is an important 
algorithm for avoiding obstacles and collisions with other robots. The simulation results show that the 
proposed algorithm can accurately and quickly hunt the moving target with an unknown behavior in an 
environment that contains obstacles. The main contributions of this paper are summarized as follows: 
i) Design of the simulation model of efficient and robust cooperation and coordination of multi-robot systems 
for hunting algorithm; ii) This hunting method is proposed to find an optimal, accurate, and safe path to catch 
the dynamic target by avoiding obstacles; iii) This method relies on properties that enable the robots' 
movements to change in real time to rapidly adapt to changes in the pursuit; and iv) Reducing the resources 
required for the implementation of the hunting process based on the uncomplicated calculations in this 
proposed strategy, which must be calculated at the real time. The rest of the article is structured as follows: 
section 2 discusses some related works, section 3 introduces the basic theoretical knowledge of methods 
used, section 4, presents the proposed hunting strategy of dynamic target for multi-robot systems and how it 
solves the cooperative hunting problem, section 5 gives the simulation test and verifies the feasibility of the 
algorithm. In section 6, conclusion and future works are discussed. 


2. RELATED WORKS 

This section gives a quick overview of some of the recent methods proposed for solving the multi- 
robot hunting problem. In the multi-autonomous underwater vehicle (multi-AUV) hunting algorithm, a 
leader-follower formation algorithm [24] is proposed by Cao and Guo to solve the problems of hunting a 
target. This method is based on leader-follower formation. The process of hunting is composed of three 
phases. The initial phase is task assignment, then formation tracking phase, and finally the phase of target 
capture. With this proposed algorithm, all hunters arrive at the same time at the hunting locations, cutting 
down on hunting time and preventing the escape of the target. In our work, we added a new role in the swarm 
in addition to the leader and the follower in order to find the target quickly, and change the formation to 
adapt to the hunting situation, and prevent the target escaping. 

Cao and Xu [25] proposed a multi-AUV hunting algorithm based on dynamic prediction for the 
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trajectory of the moving target to increase the performance of target hunting of multi-AUV. At the beginning 
of the hunting process, the prediction of the possible position of the target is performed while the target is 
moving using sample points that are updated dynamically in a short delay by using the fitting of a 
polynomial. Then, to assign the intended hunting points for each AUV, the method of negotiation is used. 
This method uses a deep reinforcement learning (DRL) algorithm to help the AUVs to arrive rapidly at 
targeted hunting spots. This method has some limitations as hunting a target with intelligent behavior who 
can escape easily from the hunters. 

In [26] they proposed the adaptive bio-inspired neural network (ABNN) method based on ABNN 
which is addressed to solve repeated and inefficient cooperative hunting of many targets by multi-robot 
problem in a workspace containing big-size obstacles using the new shunting equation which allows to 
optimize the hunting performance of the targets by the multiple hunters. This method is designed to catch 
targets by predicting their paths while avoiding the big sized obstacles. Hamed and Hamlich [27] presented a 
new strategy based on the wolf swarm algorithm for hunting a target with unexpected behavior by means of a 
cooperative multi-robot system. This method which is inspired from wolf’s behavior contains three types of 
wolves helps to reduce the time needed to capture the dynamic target and prevent it escape. However, this 
method has some limitations such as it is used only to capture a single target, and it is not practical when the 
research space contains obstacles. 


3. THE PROPOSED STRATEGY 

Our idea came from both combining an improvised multi-robot cooperation strategy for hunting a 
dynamic target [26] and the artificial potential fields method [28]. The objective is to hunt a target with 
random behavior and velocity by several robots. This hunting process is based on a strategy inspired from 
wolf swarm algorithm [6] where all robots in this multi-robot system have a specific role. This role is 
changeable according to the hunt situation. Each robot has to follow the hunting strategy while avoiding 
collisions with obstacles and other robots. We will give a brief introduction on an improvised multi-robot 
cooperation strategy for hunting a dynamic target and the artificial potential field. 


3.1. Improvised multi-robot cooperation strategy for hunting a dynamic target 

This paper adopts this improvised multi-robot cooperation strategy for hunting a dynamic target 
which is an efficient method to solve this kind of multi-robot tasks. This method is adapted from the way 
which the wolves use for hunting with some modifications to the wolves' behavior. The robots in this method 
are divided in three roles, the leader wolf, the antagonist wolves and the follower wolves. The N robots use 
for hunting the odor concentration of the prey which is expressed with this fitness function Ø (R;, T), where 
T is the target vector position and r = (1.... N). The closer the robot is to the target, the greater the fitness 
function Ø (R, T) value. The function value Ø (R;, T) is independent of the number of dimensions of the 
search space(n) Ø: IR" > IR. 


3.1.1. The leader 

The leader wolf R! is supposed to have a great deal of knowledge about prey’s position, and being 
the closest one to the prey. This leader starts howling and all the follower wolves join their leader to chase 
the target. The roles exchange between wolves is as follows: if Ø (Rs, T) > Ø (R!, T) then the Robot R, 
becomes the new leader and the ex-leader becomes an antagonist. 


3.1.2. The antagonist 

The robots with the role of the antagonist wolf follow the method described below in order to track 
the target without following the leader. Within the swarm, the number of antagonists (A_wolves) is ranging 
between [(N-1)/(A +1) , (N-1)/A], where à = [1, N/2] is the antagonism proportion factor. The research area is 
determined from the position of the leader, because this leader is always closest to the prey. This wolf only 
searches in a specific angle. The measure of this angle which is between the x axis and the straight line from 
the antagonist and the leader, is calculated in the following way: 


1 a 
Ry- Rky 


O = 2artan (1) 


2 2 iat 
Je- Rkxt1) + (Ry- Rky) + R}- Rut] 


Rj and Ry are the coordinates of the antagonist wolf Rẹ with 1 < k < (A_wolves), and the angle © value is 
by radian. The value of this angle is converted according the (2) to adapt it with the algorithmic method. 


,_( 9,0€ [0,7] 
aS F +2n, © ¢ [0,7] (2) 
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Using the aforementioned angle measure, the antagonist wolf examines the odor of prey in certain positions. 
ts = pr) 
These temporary positions (ee » Riy ) are calculated by the (3). 
® 


= 2np 
>) a (fe) fp (*) 5 cos erg, 6) 
Rky © Rky By sin@=) 


The vector B is the seeking vector (Bı, B2) ‘which is expressed in the (8). The factor y is the amount of the 
global directions whither the antagonist Ry will search for the prey, and ọ is the factor for the advancing 
direction where: 


yx 6! yx6' 
ọ = on -l , on +n (4) 


The two integers l; and l2 are responsible for limiting the seeking area. Their values are in interval [1, y]. 
The antagonist Rx updates its position to a temporary position where it senses the highest value of the prey's 


odor concentration provided Ø(R¢ , T) > OW (Rx, T). The antagonist wins the leader position 
if Ø(RẸ, T) > ØRL, T). 


3.1.3. The follower 

In this swarm, there are N-A_wolves-1 followers. This hunting strategy defines two types of 
behavior for each follower. The follower wolf switches between the two behaviors according to the distance 
between this wolf and the leader Deony. The distance of convergence Deony is described as: 


1 1 i 
Peons ET min(xs) aO: min(ys) ) e (9) 


where ø = ]0, 1] is the convergence factor. (xs, ys) are the boundaries of the search space. The robot is in the 
summoning behavior when the distance between this follower and the leader is greater than Deonv. The 
follower wolf advances to the leader, according the (6): 


sja 
K a 7 K o T (=) P o RÉ, a 5 ©) 2 [RL- RE | % 
; = f : -1 
Ry 0+1) Ry 0) a Ry- Ry (D E2 [R}- Ry | 
The vector (€1, €2)' is the deviation vector where €1, €2 are random numbers in 0 < €), €2 < 1, andi is 
the iteration index. a is the advancing vector which is expressed in the (8). The follower wolf behaves with 
preying behavior when the distance between it and the leader is less than Deony. The follower with this 


behavior gradually encircles and grasps the prey. The update of follower’s position is expressed by the 
expression: 


RÊ (1 RE (i RL- RÊ (i 
( = ')-( : o Ciel pt yon) (7) 
Riy (i+1) Riy (1) Yo Ry- Rjy (1) T2 
where (ti, T2) is the encirclement vector, whereas 0 < 11, t2 < 2 — (i/max_Iteration), and y is the preying 


vector which is expressed in the (8). 
The vectors a, B and y are calculated as follows: 


a) _ 4 fy )= 1 ar — (max&s)- a 
n 3° (7 2" e o min(ys) / ` à (8) 
The scalar S is the step factor where S € ]0,1[. 


3.2. Artificial potential fields (APF) 

The path planning problem is to develop a strategy for determining a suitable path for the robot. In 
the case of a mobile robot, the robot should move in the workspace from the initial position to its target or 
final position. In this paper, we adopt the APF [28], [29] in order to plan the path, the robot will follow. In 
the same workspace which is a two-dimensional space, we have N robots. Each robot has coordinates that are 
noted by q (x, y). 
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In this method, we consider that the robot moves in a potential field of two virtual forces. The 
repulsive potential field Urep (q) is generated by the obstacles to help collision avoidance. The attractive 
potential field Uau (q) is generated by the target to attract the robot to its goal destination. The expression of 
the attractive parabolic potential field Uar (q) is calculated depending on the position of the robot and the 
target. 


Uatt(xy)= 5 EV - 8s)? + (y- 8)? (9) 


Where ôx , dy is the position of the target. The scalar & is the attracting factor where € > 0. 
The potential is greater when the robot is far from the target, and it decreases when the distance 
between the robot and the target decreases. The attractive force applied to the robot is calculated by (10.a): 
Fatt (%Y) = — VUatt (x,y) (10.a) 


By partially deriving the potential attractive equation to the axis x and y as follows: 


Ua (%Y) 
Fran y= —S—— (10.b) 
oUt E&Y) 
Fyatt Y= ar (10.c) 
The attractive force Fan equation on the x axis and y axis is expressed as follows: 
Fiat y) = —§ (x - ôx) (11.a) 
Fyar&y) = -$ O - ôy) (11.b) 


The expression of the repulsive potential field Urep (x,y) is calculated depending on the position of the robot 
and the obstacle. The other robots in the multi-robot system are also considered as obstacles. 


Urep, y) = Urepo (x,y) + Urepr (x,y) (12) 


Where Urepo (x,y) is the repulsive potential field generated by the obstacles. Urepr (x,y) is the repulsive 
potential field generated by the other robots. 
1 


d(qi,voj) po 
0, d(qi,voj)=po 


1 
Krep ( 


ee ), d(qi,voj)<po (13) 


1 
d(qi,ork) pr 
0, d(qi,ork)=pr 


l . 
Urepr(qi)=42 Krep ( ), d(qi,ork)<pr i 


Where d(qi,voj) is the Euclidean distance between the robot 1 (i= 1,2,....,N) and the obstacle j (j = 1,....,M), 
d(qi,ork) is the Euclidean distance between the robot i and k, po is the safety distance from the obstacle, pr is 
the safety distance from the other robot. 

The repulsive force applied to the robot is calculated by the (15.a): 


Frep (X,Y) = -VU rep (X,Y) (15.a) 


By partially deriving the potential repulsive equation to the axis x and y axis, the repulsive forces are 
expressed as follows: 


OUrepo (x,y) 


m (15.b) 


F repo &.y)= 


Hunting strategy for multi-robot based on wolf swarm algorithm ... (Oussama Hamed) 


164 m) ISSN: 2502-4752 


OUrepo (x,y) 


Fyrepo (x y)= dy (d 5 K9) 
OUre (XY) 
F cs EP 15. 
xrepr OGY) ôx ( 5 d) 
OUre (X,Y) 
F append UAE 15. 
prep OGY) dy ( e) 
The repulsive forces Frepo and Frepr equations on the x axis and y axis are expressed as follows: 
g ( ~) (vxoj-xi) dtai T 
Frepox(qi)=- TEE d(qi,vxoj) po/ d(qi,vxoj)3’ (alvas pa (16.a) 
0, d(qi,vxoj)=po 
1) (vyoj-yi) ar 

J Krep(7——-—) 2, daivyoj)< 

Frepoy (qi)=- ep d(qi,vyoj) po/ d(qi,vyoj)? (aivyoj)<po (16.b) 
0, d(qi,vyoj)>po 

ki ( 1 1 ) (vxrk-xi) iani 

Freprx(qi)=- 4 "°P (daioxrio pr) d(qisoxrks” STOP" (16.0) 
0, d(qi,oxrk)=pr 
1\ (vyrk-yi) ; 

. Kr - k)< 

rrepa) = {RP Ga pe) daioi Saito (16.4) 


0, d(qi,vyrk)=pr 


3.3. Proposed cooperative hunting strategy 

For hunting in multi-robot systems, one of the most important considerations is to avoid colliding 
with obstacles while maintaining the hunting strategy. In this part, we explain the hunting strategy which 
combines the artificial potential field and an improvised multi-robot cooperation strategy for hunting a 
dynamic target. In the presented strategy in the section (III.A), there is no consideration for any obstacle 
whether fixed or moving obstacles (other robots) Figure 1. The robots are looking for the target T in a 
workspace W that does not include any obstacle. 


Antagonist 
Target 


Target 


Leader s D 


Follower Follower ? 
y, Leader 


Follower 
Follower 


x x 


Figure 1. Hunters looking for the target Figure 2. Repulsive potential field generated by obstacles and robots 


During the hunt, there is a high possibility of a collision between robots. The workspace is not 
always free from obstacles. In the presented paper, static and moved obstacles are taken into consideration. 
By using the repulsive potential field Urep (q) which is generated by the obstacles and the other robots, the 
hunter robots avoid collision with any obstacle (Figure 2). 

In this method, the attractive potential field Uatt (q) is generated by a virtual target instead of the 
target. The virtual target’s position is calculated by the motion control ons (3), (6-7) used in the introduced 
hunting method. This method is based on dividing the path followed by the robot towards its target into many 
virtual target’s positions in order to guide the hunter to the prey through a safe path. In the Figure 3, the 
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follower advances towards the leader, and its next position is calculated with random parameters using (6) 
into area (€1 x al x a2 x €2). For each iteration, the robot calculates its next position using the appropriate 
equation. This next position is updated each iteration is the virtual target position. 

As shown in Figure 4 the robot's path towards the target is a set of virtual target’s position that the 
robot will pass through. The equations used to calculate the virtual target’s positions contain random 
parameters. Therefore, for the same situation, the robot can pass through different paths towards the target. 
When at least one virtual target’s position is located in a moving obstacle (Figure 5) or a static obstacle area 
(Figure 6) the path is modified in a way that the robot does not collide with the obstacle. As shown in Figures 
7 and 8, the robot tries to reach the virtual target due to the attractive force that it generates, but the repulsive 
force of the obstacles drives the robot away. 


© + 
3 
w 
a % 
eS 4 o m 
* 4 
2x &) R Pt 
, Virtual target position # vaalee pai ®© 
K Follower + Follower 
-H Leader + Leader 
@ Prey @ Prey 
Figure 3. Follower’s advancing process Figure 4. The path of the follower towards the leader 
towards the leader 
@ @ 
= 
+ 
* @ # » 3 
yy Virtual target position x * $% Virtual target position aR % 
© Follower BB Q Follower $ 
= Leader D == Leader @ 
@ Prey @ Prey 
Figure 5. The follower’s path intersects with Figure 6. The generated path passes through the 
another robot obstacle 
© © 
+ +> | 
J Virtual target position ' D JẸ Virtual target position i <x 
xis Follower ® Follower of 
= Leader te Leader l © 
@ Prey @ Prey 
Figure 7. The repulsive force generated by the Figure 8. The repulsive force generated by another 
obstacle prevents the robot to reach the position of robot prevents the robot to reach the position of 
the virtual target the virtual target 
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The robot’s motion control equations have been combined with the equations of the artificial 
potential field. The forces are calculated using values of the virtual target’s position. The attractive force for 
the virtual target when the robot is a follower is different when the robot is an antagonist (17.c). The same for 
the two behaviors of the follower. The attractive force for the virtual target when the follower robot is in 
summoning behavior (17.a) is different when the behavior of the follower robot is preying (17.b). 

In the (11.a) and (11.b) the value 6x and 6, are replaced with the values of the virtual target’s 
position according to the role and the behavior of the robot. 


Fay) = (E {Rf +o.E(R} — RAD. Ry -RÉ H-E- {Ry to. €2.( Ry — Ri). RY -R5 Pp (17.a) 


Fan(x,y) = (-§ (x = {Rix + y1- TCR, — Rid) & (Y = {Ri + y2 (Ry Ry) }) (17.b) 
Faly) = (- § (X— {Rix + Bi. cos (2)}), - & (y — {Riy + Bo. sin(=*)}) (17.0) 


The repulsive forces are always the same for both types of obstacles. The equations of the repulsive forces 
are expressed as: 


Frepo(qi)=Frepox(qi), Frepoy (qi) (18.a) 
Frepr(qi)=Freprx(qi), Frepry(qi) (18.b) 
Frepx(qi)= Frepox(qi)+ Freprx(qi) (18.c) 
Frepy(qi)= Frepoy(qi)+ Frepry(qi) (18.d) 


From the (18.c)-(18.d), if d(qi,voj) < po the distance between the robot and the obstacle is less than the safety 
distance po and d(qi,urk) < pr the distance between the robot and the other robots is less than the safety 
distance pr, then the equations of the repulsive forces will be expressed as: 


M N 
SER 1 1 (vxoj-xi) _ 1 1 (uxrk-xi) 
Frepx(qi)= ; i Krep (camp z) d(qi,vxoj)3 ; Krep (Cares a) d(qi,vxrk)’ (19.a) 


M as N : 
ca 1 1\ (vyoj-yi) 1 1\ (vyrk-yi) 
R wS Krep (aa pe) ta” Des Krep (aa pe) ata (120) 


Using this strategy, the robot will adjust its path, and avoid going to the virtual target’s position that was 
calculated, if this position would cause a collision between the robot and moving or static obstacles. This new 
path ensures that the robot reaches the target without hitting any obstacle. As shown in Figure 9, when the 
virtual target’s position is within or close to the obstacle d(qi,voj)<po or d(qi,vrk)<pr , the robot does not go 
to this position. 


= 


& The next position 
w Virtual target position 
® Follower 


d Leader 
@ Prey 


Figure 9. The robot adjusts its next position to avoid the obstacle 
4. RESULTS AND DISCUSSION 
In order to verify the effectiveness and the performance of the proposed strategy, the simulation 


experiment of a hunting process is given. This hunting process is performed by means of a set of robots, 
which will determine the path in order to capture the moving target. The target and the robots are within the 
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same search space which contains several obstacles. The simulation software is MATLAB. In this 
experiment, we assume that the two-dimensional workspace 20x20 contains several robots that are moving 
all the time (Table 1), and several static obstacles (Table 2). The objective here is to use the hunting strategy 
to plan the path which the robots will follow in order to capture the target without colliding with obstacles. 
Therefore, each robot take off from the starting point towards the endpoint which is fundamentally dependent 
on the behavior of the target. The target updates its position randomly and unpredictably (Figure 10), where 
Px and Py are the coordinates of the target: Px (i+1) = Px (i) + pl and Py (i+1) = Py (i) + u2. wl and u2 are 
random numbers, where - 1 < ul,u2 < 1. 

We start the simulation by initializing randomly the location of the robots and the target as shown in 
Table 1 and Figure 11. The multi robot system is consisted of four mobile robots, one of them is the 
antagonist (V) and the three others (X) are the followers and the leader, and in addition to the target (O). It is 
worth noting that the process of capturing the target only takes place when at least three robots encircle the prey. 

In this random situation, the objective of the robots is planning the path to the target which is always 
in motion at random velocities. The path of the robots has to be admissible. These robots have to avoid 
collision between them and with the obstacles. The simulation results are shown in the Figures 12 and 13. 
The robots hunt the target at the position [12.3395, 11.0244] as shown in Figure 14. The hunting process is 
successfully performed. 


Table 1. The initial positions of the robots and Table 2. The initial positions and the radius of 
target obstacles 
Parameter Initial Parameter Position Radius 
Target 9;11.5 Obstacle 1 (10,10) 0.7 
Robot 1 17.0606 ; 12.4411 Obstacle 2 (8,12) 0.7 
Robot 2 7.0190 ; 10.2650 Obstacle 3 (7,9) 0.7 
Robot 3 8.0362 ; 1.5193 Obstacle 4 (8, 14) 0.7 
Robot 4 4.7983 ; 2.4664 Obstacle 5 (15,14) 0.7 
Obstacle 6 (11,14) 0.7 
20 20 
18 18 
16 16 
14 * (*) 14 
12 * 12 
x 10 Sa * 5 10 R 
> O*>) = > 
8 ~ 8 
6 6 
4 4 
2 2 
0 o 
oO 2 4 6 8 10 12 14 16 18 20 o 2 4 6 8 10 12 14 16 18 20 
x axis x axis 
Figure 10. The path of the target during Figure 11. Initial position of the robot and 
hunting process the target in the workspace 
20 
18 F 
16 
u Cx x O 
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Figure 12. The path of each robot during hunting process 
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As shown in Figure 13, the evolution of the best cost function depends on the closest wolf to the 
target. This function increases and decreases in different intervals, and that is mainly due to the dynamic of 
the target, which sometimes moves away from the hunter robots. In the fifth iteration, the function decreased 
significantly, as a result of the target moving faster and encountering another hunter who became much 
closer to the prey. This sharp decrease, which helps to catch the prey quickly, is a result of the well- 
distributed hunters in addition to the presence of the antagonist. The value of the best cost function tends 
towards zero on the 16" iterations, where the robots hunt the target. 
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The trajectories of each robot are secured and eligible towards the target as shown in the Figures 15 
to 18. The coordination and cooperation between the robots are well organized. As shown in the Figure 19, 
the first robot is constantly approaching the target, and the noticeable decrease between the first and sixth 
repetition is because the target and the robot take opposite paths, which makes them approach each other, and 
in the 16" iteration it catches the prey. In Figure 20, the second robot makes a safe distance between it and 
the obstacles (ob1—ob6) in order to make the path more secure. We used large numbers in the x axis to make 
the progression of the distance in y axis more accurate. The robot's path is usually not straight, as random 
values used in the equations of motion control affect its velocity. 

Based on these experiments, the hunter robots were able to search and chase the target without 
deviating from their main path despite the presence of some small biases in their behaviors. This strategy 
ensured the continuous pursuit of the prey despite its random behavior and the presence of obstacles. The 
results of the simulation experiments in the hunting strategy based on WSA and APF show that the proposed 
approach can satisfy the cooperative hunting task for a dynamic target by multiple hunter robots. 
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Figure 18. The path of the robot 4 during 
hunting process 
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Figure 20. Distances between the robot 2 and the obstacles 


5. CONCLUSION 
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In this paper, we proposed a new cooperative hunting strategy for a multi-robot system based on 
wolf swarm algorithm and artificial potential field. This strategy is dedicated to hunt targets with random 
behavior. This cooperative hunting is performed by several robots that are divided into three roles: the leader, 
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the follower, and the antagonist. To avoid static or dynamic obstacles and reach the hunting point rapidly, the 
artificial potential field algorithm is used. 

In complex search spaces, when the target is moving in random and unpredictable patterns, the 
problem of continually escaping prey from hunter robots may occur in a multi-robot system. Moreover, they 
are vulnerable to disturbances and loss of prey trace due to random prey behavior and the presence of 
obstacles in the search space that can cause the hunters to lose formation. In this paper, we study how to 
improve the hunting process by increasing the decision control to make the robots avoid collisions with 
obstacles while keeping the hunting process. The algorithm proposed in this paper is verified by numerical 
simulation. From the results of the simulation, robots can find and capture prey within the search space faster 
and without letting the prey escape thanks to the cognitive behavior of hunters. In order to make the behavior 
and characteristics of the robot flexible and effective during the hunting process, we added the artificial 
potential field to cooperative hunting strategy. The simulation showed that the APF leads to an optimal safe 
path towards the target. However, in this paper, the hunting process is performed only for a single target. 
Therefore, in our future research, we will modify the proposed hunting strategy to make it applicable for 
hunting multi-dynamic targets. 
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